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Abstract —Innovation may be limited by the scarceness of 
resources, such as financial, homogeneous area, skilled 
labor or other research needs, for example the difficulty in 
experimental control of large areas in the field. In 
research areas such as chemistry and physics, designs are 
used in such a way that when compared to the agronomic 
designs, they result in a reduced number of experimental 
units, which in this work are called economic designs. 
Thus, the objective of this study was to identify significant 
factors and effects (p-value<0.05) with the application of 
economic factorial plans for agricultural experiments in 
the field. For this, two studies were carried out, the first 
comparing the 3 k experimental designs with the full 
factorial, using the same number of experimental units for 
both, in order to evaluate the regression model. In the 
second study, the three experimental designs were 
compared, maintaining the number of repetitions and 
evaluating the influence of the reduction of the number of 
experimental units in the approach of the regression 
models. At the end of the two experiments, it was observed 
that the 3 k design was able to identify the same effects and 
response surface similar to the control with a significant 
reduction (64) in the number of experimental units. 
Keywords — Response surface methodology, Innovation, 
Process efficiency, Economy. 

I. INTRODUCTION 

The Response Surface consists of a set of mathematical 
and statistical methods used in the study of the relation 
between factors and responses [1,2,3], that may be the 
result of a first-order polynomial or more complex 
interactions with polynomials of a higher order [4], this 
relation may be stated by equation (1): 

y = f(xi,x 2 )+e (1) 

where y is the answer, because of the variables xi and x 2 , 
added to the experimental error e [5,6]. 

The designs of the Rotational Central Composite Design - 
RCCD and 3 k are second order designs, that is, the 
regression equation that describes the behavior of the 
variables also presents the quadratic terms and has the 
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characteristic of repeatability in the central point, are 
flexible and require smaller numbers of experimental units 
[7,8] compared to the full factorial applied in the 
investigation the effect of many treatments. For example, a 
full factorial with two factors, each factor with five levels, 
contemplates twenty-five treatments repeated four times, 
which results in one hundred experimental units. However 
in 3\ in the same analysis we would have nine treatments 
repeated four times. When observing the Rotational 
Central Composite Design RCCD, the addition of the 
vertices (+ 1.41) that form the star increases the coding 
matrix for eleven treatments repeated four times [9]. Both 
the 3 k and RCCD are members of DOE (Design of 
Experiments) that includes an important framework of 
designs that support scientific findings [11]. 

The major difficulty is to adapt the economic factorials (3 k 
and RCCD) for agronomic experimentation, observing the 
basic principles of agricultural experimentation [10,12], 
which is characterized as a different scenario when 
compared with controlled environment research, 
presenting a low coefficient of variation (less than 3%) 
[13], observed the sensitivity of the statistical tests in the 
higher variation coefficients (higher 6%) [14], influenced 
by the uncontrolled factors for the field experiments. 

Thus, this study aimed to identify significant factors and 
effects (p-value< 0.05) through the application of economic 
factorial planning and response surface for field 
experiments. The studies show that for both, linear and 
nonlinear models, it was possible to identify the same 
significant effects reducing experimental units. 

H. EXPERIMENTAL PROCEDURES 

The research was divided into two studies, the first aims to 
compare the regression models generated by 3 k and the 
control (full factorial), taking into account the maintenance 
of the same number of experimental units and repetitions. 
In the second, the regression models generated by 3 k and 
RCCD were compared with the control, maintaining the 
number of repetitions, but reducing the number of 
experimental units. 
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The data from study I are an integral part of 
Zimmermann's [15] research and consist of a factorial 3x3, 
whose factors were factor 1 , soil density ( 1 . 12 , 1.26 and 
1.39 g.cm" 3 ) and factor 2, doses of a nutritional compound 
called "FTE-BR12 microelement" (0, 1, 2g) for the rice 
dry mass response variable in logarithm RBD 
(Randomized Blocks Design) was used to allocate the field 
experiment that resulted in nine treatments, repeated three 
times. 


The study II consisted of an experiment organized in a full 
factorial (5x5), in RBD with four repetitions of treatments, 
being the factor 1 , the days after emergence (dae) of corn, 
factor 2 , the days after the application (daa) of corn 
defensive agent and the variable response to leaf width of 
the corncob in millimeters [16]. 

The data were coded per Montgomery methodology [4], 
presented in Table 1, resulting in the different number of 
treatments in the full factorial, 3 K and RCCD. 


Table.1: Encoding matrix, with treatment numbers for: 5x5 full factorial (two factors with five levels each), 3 2 (tw ! o factors 
with three levels each) and Rotational Central Composite Design RCCD (two factors with three levels + vertex), without 

repetition. 



-full factorial- 

- 3 : 

l 

-RCCD- 


treatment 

code 

factor A 

code 

factor B 

code 

factor A 

code 

factor B 

code 

factor A 

code 

factor B 

experimental 

responses 

1 

-2 

-2 

-1 

-1 

-1 

-1 

Y1 

2 

-1 

-2 

0 

-1 

+1 

+ 1 

Y2 

3 

0 

-2 

+1 

-1 

+1 

-1 

Y3 

4 

+ 1 

-2 

-1 

0 

-1 

+ 1 

Y4 

5 

+2 

-2 

0 

0 

0 

0 

Y5 

6 

-2 

-1 

+1 

0 

0 

0 

Y 6 

7 

-1 

-1 

-1 

+ 1 

0 

0 

Y7 

8 

0 

-1 

0 

+1 

-1,41 

0 

Y 8 

9 

+ 1 

-1 

+1 

+1 

0 

1,41 

Y9 

10 

+2 

-1 



0 

-1,41 

Y10 

11 

-2 

0 



1,41 

0 

Yll 

12 

-1 

0 





Y12 

13 

0 

0 





Y13 

14 

+ 1 

0 





Y14 

15 

+2 

0 





Y15 

16 

-2 

+1 





Y16 

17 

-1 

+1 





Y17 

18 

0 

+1 





Y18 

19 

+ 1 

+1 





Y19 

20 

+2 

+1 





Y20 

21 

-2 

+2 





Y21 

22 

-1 

+2 





Y22 

23 

0 

+2 





Y23 

24 

+ 1 

+2 





Y24 

25 

+2 

+2 





Y25 


Source: Adapted from Montgomery (2016) 


Considering the full factorial for the two experiments, a 
general model of variance analysis was fitted, equation 
( 2 ): 

y = (3 0 + b + (3iXi+ p 2 xj + Pnx, 2 + p 22 xj 2 + PnXjXj + e (2) 
where y is the estimated response, Po is intercept, b is the 
block. Pi is the linear coefficient of factor 1 , x, is the 
normal distribution of waste (Shapiro-Wilk and 
Kolmosgorov-Simirnov test), block additivity (Tukey test) 
and confirmed by visual inspection of residues. 


variable 1 in level ;, p 2 is the linear coefficient of factor 2 
Xj is the variable 2 in level j, Pn is the quadratic 
coefficient of factor 1 , P 22 Is the quadratic coefficient of 
factor 2 , Pi 2 x;Xj is the interaction between factors 1 and 2 , 
e is the experimental error, with the factors considered 
qualitative, were observed the assumptions of variance 
homogeneity (Bartlett and Levene tests) 
The regression models were adjusted (quantitative 
variables) after the validation of the F test, starting from 
the most complete model and subtracting terms, tested the 
significance ( p-valor< 0,05) of the coefficients (t-test). 
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In the selection of the most adequate model for describing 
the response, it was observed: 

1. The absence of non-significant coefficients (p- 
value> 0.05) in all the regression models tested; 

2. Analysis of variance (ANOVA) for the 
hypotheses: Ho: the models are the same; the 
regression models did not differ significantly ( p- 
value< 0.05); Hi: the regression models differ 
significantly; 

3. Comparison of the Rousted, the regression model 
being the best fit, which presented the highest 

Readjusted, 

4. Comparison of the Akaike Information Criterion 
(AIC), the best fit regression model, which 
presented the lowest AIC [17]. 

5. In the absence of differences between the models, 
the “law of parsimony” was observed. 

The AIC was chosen as an adjustment test because it 
penalizes the lack of adjustment and complex models, 
being in agreement with the law of parsimony [18, 19]. 
Comparison between the coefficients of determination (R 2 ) 
were not accomplished, since this is influenced by the 
number of terms of the regression model, according to 
Adair and Silva [20], the withdrawal of terms increases the 
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sum of the squares and, consequently, there is an increase 
in R 2 . 

In the development of 3 k , according to Montgomery’s 
methodology [4], the data were coded (-1, 0 and +1) and 
for RCCD, according to the same author's methodology, 
the vertices were added (+ 1,41). We observed the 
assumptions for both, tested the need to work with pure 
error (ep) (ANOVA) and repeated the process of selecting 
the best-fit model, as described in the full factorial. 

The pure error (ep) does not have correlation with the 
model [ 21 , 22 ], therefore, it does not depend on the 
estimative responses (y), reflecting only the dispersion at 
each factor level of the repeated responses (y) around the 
mean (y), calculated by X(y-y) 2 , summation of the squared 
differences between the original response (y) and its mean 
(y), which results in the estimation of the variance for the 
model, influencing the values of the F test, whether the 
model is adjusted or not [23]. 

Since methodologies are from different areas, each 
factorial presents different particularities. The R software 
was the program chosen for this study because it presents 
an immense range of packages [24]. For didactic purposes, 
the functions and packages used in this study are detailed 
in the mentioned sources in Table 2. 


Table 2 - Packages, functions, applications and sources for the development of full factorial, 3 k and Rotational Central 

Composite Design -RCCD in software R, used in this study. 


packages 

functions 

applications 

source of the package: 

R-base 

AIC () 

Akaike Information Criterion 

R Core Team (2018)[24] 


aov() 

Analysis of variance (ANOVA) 



bardlet.test() 

Homogeneity of variance test 



boxplot() 

BoxPlot 



hist() 

Histogram 



ks.teste 

Kolmogorov-Simirnov test 



lm() 

Regression analysis 



plot() 

Builds graphics 



qqplot( ) 

Accumulated distribution chart 



shapiro.test() 

Normal distribution of waste 



s.test() 

Normal distribution of waste 



summary() 

Coefficients, ANOVA, coefficients of variation 


MASS 

boxcox() 

Tests the need of data transformation 

Venables; Ripley (2002)[30] 

rsm 

persp () 

Builds response surface 

Lenth (2013)[31] 


PurreError() 

Pure error 



SO() 

Quadratic models or second order 



Varfcn() 

Contour graphs 


openxlsx 

read.xlsx() 

Reading data straight from excel 

Walker (2015)[32] 

car 

leveneTest() 

Levene test 

Fox; Weisberg (2011)[33] 

dae 

tukey 1() 

Block additivity test 

Brien (2014) [34] 

ExpDes.pt 

fat2.dbc() 

Analysis of qualitative and quantitative variance 

Ferreira, et al (2013)[35] 

Source: Self-elaboration 


III. RESULTS AND DISCUSSIONS Kolmogorov-Simirnov for both, Tukey for block additivity 

In study I, verification of the assumptions for the full and pure error test only in 3 k ) revealed higher p-value 

factorial and 3 k (Bartlett, Levene, Shapiro-Wilk, scores that the significance (a <0.05), therefore, there is 
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not evidence to reject the hypothesis Ho, satisfying the 
assumptions of homogeneity of variance and normal 
distribution of residues. For the full factorial, the effect of 
the blocks was not significant. In 3 k it was observed that it 
is not necessary to work with pure error. All assumptions 
were confirmed by visual inspections. 

For the two designs tested, the linear regression effect was 
significant ( p-value < 0.05), in agreement with ANOVA 
for the regression coefficients (t test), in which only the 
coefficient of linear density soil factor with significant 
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effect on the response variable, however, to exclude the 
other terms from the model, there is a need for adjustment 
tests in order to eliminate the possibility of a collinearity 
effect, since a variable may not have a significant effect in 
isolation, but may influence the total effect of the model 
and its exclusion would make the model less explanatory 
or less adjusted [20]. 

In ANOVA between the models it was observed that there 
is no evidence against Ho, therefore, the models do not 
differ significantly (Table 3) 


Table.3: Regression equations, ANOVA for regression models, and fit test: coefficient of determination (R 2 ) and R 2 adjusted 
Akaike's Information Criterion (AIC) for experiment I, coefficients of variation of 6.36%. 


Model 

Regression equation of the proposed models 

DF 

p-value 

R 2 

p2 . 

ajusted 

AIC 


-Pull factorial- 






1 

y = 3,331+0,335xi+0,061x J -0,085x 2 I -0,042x 2 j -0,009x,Xj 

21 

0,720 

0,256 

0,078 

7,574 

2 

y = 3,346 +0,335xi -0,024 Xj -0,085x 2 i -0,009 2 XiXj 

22 

0,696 

0,250 

0,157 

12,10 

3 

y = 3,374+0,164xi -0,024xj -0,0092XiXj 

23 

0,438 

0,228 

0,107 

9,093 

4 

y = 3,384+0,154x1-0,034xj 

24 

0,897 

0,227 

0,163 

7,114 

5 

y = 3,384+0,154x1 

25 

0,589 

0,217 

0,185 

5,483 

1 

y = 3,590+0,154x1+0,0341x J -0,085x 2 1 -0,045x 2 J -0,009x, Xj 

22 

0,681 

0,255 

0,120 

10,12 

2 

y = 3,56 1 +0, 1 54x1+0,034 1 Xj-0,085x 2 i 

23 

0,688 

0,249 

0,152 

8,328 

3 

y = 3,533+0,154xj+0,0341xj-0,042x 2 j 

23 

0,688 

0,233 

0,133 

8,918 

4 

y = 3,504+0,154x1+0,0341Xj 

24 

0,580 

0,227 

0,163 

7,114 

5 

y = 3,504+0,154x1 

25 

0,427 

0,217 

0,185 

5,483 


Where: x, variables of factor 1 at level i; Xj variables of factor 2 at level j; DF are the degrees of freedom; p-value is the is the 
probability of F ca icuiated, significant (p<0.05); R 2 is the coefficient of determination and R 2 a dj US ted is the is the coefficient of 
determination adjusted both for regression; AIC is Akaike's Information Criterion; Source: Self-elaboration. 


Confirming the model selection (Table 3), the highest 
Readjusted and lowest AIC belong to Models 5. The 3 k 
identified the same factor and significant effects when 
compared to the control (full factorial), however with the 
results estimated by 3 k model were 2.91% higher. The 
comparison between the models revealed homogeneous 
variances (0.01654) and the difference between the 
residues quantified by R 2 = 0.9623. 

This result is agreement with Konishi and Kitagawa [25], 
models with small variability fit well the reduction of 
experimental units. But to validate an experiment subject 
to variations of the environment a greater number of 
experimental units and repetitions is recommended, 
minimizing the experimental error [15, 22, 23], and, the 
Surface Response Methodology is an efficient tool to 
optimize the properties of processed foods [26]. Using 
mathematical and statistical techniques, experimental 
results indicate a combination of factor levels within an 
optimal region [4]. 

Study II observed regression models with more complex 
interactions. In the analysis of variance (ANOVA), 
performed for the control (full factorial), composed of one 
hundred experimental units, the interaction between the 


(qualitative) factors was significant (p-value<0.05), 
therefore, interaction was observed in the Bartlett and 
Levene test, both with p-value>0.05, given the assumption 
of homogeneity of variance. Shapiro-Wilk and 
Kolmogorov-Simirnov indicated the normal distribution of 
waste and the Tukey test confirmed that the blocks were 
not additive to the model. 

In the development of 3 k , thirty-six experimental units 
were used. The assumptions were met, but also it was 
identified the need to work with pure error. In RCCD, 
composed of forty-four experimental units, the 
assumptions were not met and the model also indicated the 
need to work with pure error. In all the designs the effect 
of quadratic regression was significant (p-value<0.05). 

All regression models tested (Table 4) showed significant 
coefficients (t-test, with normal error for full factorial and 
t-test with pure error for 3 k and RCCD). According to 
Faraway [27], the identification of the need for pure error 
in the t-test, refers to the option with the lowest variance 
and, therefore, increases the accuracy of the test. 

The ANOVA (analysis of variance) between the analyzed 
regression models revealed that they differed significantly 
(p-value< 0.05) (Table 4). 
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Table.4: Regression models analyzed, regression equations, ANOVA (analysis of variance) for regression models (DF and p- 
value), and fit test: Coefficient of determination (R 2 ) and R 2 adjusted, Akaike Information Criterion (AIC) for Experiment II, 

coefficients of variation of 1.8%. 


Model 

Regression equation of the proposed models 

DF 

p-value 

R 2 

d 2 . 
ajusted 

AIC 


-p u ll Factorial- 






1 

y= 81,543+12,861 x;+13,374xj-2,319xr-2,109Xj 2 -4,26x,Xj 

94 

2.4e-14 

0,982 

0.981 

564.9 

2 

y= 77,325+12,861 Xi+13,374xj-2,319xr-4,266xiXj 

95 

2.4e-14 

0,966 

0,965 

625,0 

3 

y= 72,686+12,86 lx i +13,374x i -4,266x 1 xj 

96 

3.1e-16 

0,947 

0,946 

667,5 

4 

y= 72,686+12,861xi+13,374xj 

97 

2.2e-16 

0,857 

0,854 

766,0 

I 

-J - 

y= 82,976+26,179xi+24,710xj-6,889xr-10,689xf -15,74x, Xj 

33 

1.5 le-1 

0,980 

0.9775 

223,7 

2 

y= 71,356+26,179xi+24,710x J -15,7406x,x J 

32 

89e-14 

0,945 

0,9407 

256,9 

3 

y= 71,2579+26,179xi+24,710xj 

30 

1.84e-7 

0,838 

0,8291 

294,1 


-RCCD- 






I 

y= 90,350+20,29 lxi+18,537xj-9,707xi 2 -7,402xj 2 -15,74xiXj 

38 

5.05e-7 

0,884 

0.8737 

335.7 

2 

y= 77,906+20,291 Xi+18,537xj-15,74x lXj 

40 

5.05e-5 

0,812 

0,7981 

354,6 

3 

y= 77,908+20,29 lx;+18,537xj 

41 

2.6e-07 

0,697 

0,6830 

373,7 


Significate ( p-value<0,05 ); Where: x, variables of factor 1 at level z; Xj variables of factor 2 at level j; DF: degrees of 
freedom; p-value is the probability of F ca i cu i a ted; R 2 coefficient of determination; R 2 a djusted adjusted coefficient of determination; 
AIC- Akaike Information Criterion; RCCD-Rotational Central Composite Design. Source: Self-elaboration. 


In order to identify the models that best describe the 
experiment, we observed the highest Rousted and lowest 
AIC found in Models 1 (Table 4), composed of intercept, 
linear terms, quadratics and interaction for full factorial, 3 k 
and RCCD. All effects are significant in the model but the 
nonlinear model is not well adjusted [18]. 

It is noteworthy that other models were also tested, but 
presented lower adjustments and for didactic purposes 
were not included in Tables 2 and 3. 

The representation of the three selected models (Figure 1) 
revealed that despite the reduction of 64 experimental 
units, the regression model for the 3 k design, estimated 
from thirty-six experimental units, was able to identify the 
same factors and significant effects that the control (full 
factorial-one hundred experimental units). Robust models 
are more reliable for the researcher and not affected by the 
loss of information [28]. 

However, RCCD (forty-four experimental units), although 
also did so, showed greater distance from the control. 
According to Mateus et al., [13], RCCD did not adjust well 
to agronomic data simulation experiments with coefficient 
of variation (CV) greater than 6%. The CV of this 
experiment was 1.8% and the precision of the RCCD was 
lower than the 3 k . There are indications that the estimation 


of the vertex for RCCD by the regression model of the full 
factorial may have interfered in the precision [29], 

For Mendon§a [14], who used data simulation, the loss of 
fit of the economic models can be compensated with the 
increase of repetitions of the treatments, in which the four 
replicates (full factorial, 3 k and RCCD) were maintained in 
this experiment for purposes of comparison, with the 
objective of making achievable large experiments with 
reduction of experimental units. And [29] all this is source 
of variability within the experiment and to circumvent 
these problems in the planning and conduction phase of 
the experiments is fundamental so that the experimental 
error is not high. Furthermore, the knowledge of statistical 
tests and the assumptions for their application is 
fundamental for the research to be statistically valid. 
Therefore, economic models are efficient and indicated to 
identify the significant result in initial tests or probing 
tests, especially in experiments without prior knowledge 
[14] and besides the basic requirements of the tests and 
observation of agronomic assumptions, the analysis of the 
particularities of the experiment should be observed [10]. 
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Fig.l: Response surface composed of factor 1 - days after application (daa) of corn herbicide and factor 2 - days after 
emergence (dae) of corn, variable response leaf width of corncob in millimeters, which represents the adjusted polynomial 
regression. The control (full factorial) consists of one hundred experimental units, 3 , consisting of thirty-six experimental 
units and RCCD (Rotational Central Composite Design) composed of forty-four experimental units. 


Source: Self-elaboration 

IY. CONCLUSION 

The 3 k design was presented as an economic factorial 
capable of identifying the same factors and significant 
effects in the agronomic experiments, reducing the 
experimental units and contributing to the technical and 
economic viability of larger experiments, as long as the 
number of repetitions of the treatments was maintained. 
Data simulation was not used to experience actual 
practices. 
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